Picture for Hao Zhang

Hao Zhang

University of Massachusetts Amherst

EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer

Add code
Sep 17, 2024
Viaarxiv icon

Learning to Compress Contexts for Efficient Knowledge-based Visual Question Answering

Add code
Sep 11, 2024
Viaarxiv icon

DECOLLAGE: 3D Detailization by Controllable, Localized, and Learned Geometry Enhancement

Add code
Sep 10, 2024
Viaarxiv icon

Efficient LLM Scheduling by Learning to Rank

Add code
Aug 28, 2024
Viaarxiv icon

Camouflaged_Object_Tracking__A_Benchmark

Add code
Aug 25, 2024
Viaarxiv icon

Through-the-Wall Radar Human Activity Micro-Doppler Signature Representation Method Based on Joint Boulic-Sinusoidal Pendulum Model

Add code
Aug 22, 2024
Viaarxiv icon

Collaborative Cross-modal Fusion with Large Language Model for Recommendation

Add code
Aug 16, 2024
Viaarxiv icon

Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training Systems

Add code
Aug 14, 2024
Figure 1 for Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training Systems
Viaarxiv icon

MPC-Minimized Secure LLM Inference

Add code
Aug 07, 2024
Figure 1 for MPC-Minimized Secure LLM Inference
Figure 2 for MPC-Minimized Secure LLM Inference
Figure 3 for MPC-Minimized Secure LLM Inference
Figure 4 for MPC-Minimized Secure LLM Inference
Viaarxiv icon

LLaVA-OneVision: Easy Visual Task Transfer

Add code
Aug 06, 2024
Viaarxiv icon